Thomson's formulation of the second law: 
an exact theorem and limits of its validity 
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Abstract. Thomson's formulation of the second law - no work can be extracted from a system 
coupled to a bath through a cyclic process - is believed to be a fundamental principle of nature. 
For the equilibrium situation a simple proof is presented, valid for macroscopic sources of work. 
Thomson's formulation gets limited when the source of work is mesoscopic, i.e. when its number 
of degrees of freedom is large but finite. Here work-extraction from a single equilibrium thermal 
bath is possible when its temperature is large enough. This result is illustrated by means of exactly 
solvable models. Finally we consider the Clausius principle: heat goes from high to low temperature. 
A theorem and some simple consequences for this statement are pointed out. 



Introduction 

Since one century the laws of thermodynamics no longer appear as basic principles, 
but as consequences of the laws of matter at the microscopic level. Their derivation from 
first principles requires an identification of concepts such as temperature, work or heat 
in terms of the elementary constituents, and an elimination of the irrelevant microscopic 
variables by means of methods of statistical mechanics. This program has been partially 
achieved for macroscopic systems in the thermodynamic limit [Jl], |2|]. Here we focus on 
Thomson's (later Lord Kelvin) formulation of the second law for equilibrium systems. 
We have two basic goals. The first is to discuss a rigorous version of this formulation 
and to present a simple proof for it. This version uses a minimal number of assumptions 
and has a larger region of applicability as compared to the text-book presentation of 
Thomson's law [Jl], |2|L where it is typically derived from the law of entropy increase. 
Our second goal is to determine mesoscopic limits of applicability of the formulation. 
They arise due to the existence of large but not very large number of degrees of freedom 
(mesocopicity). The special importance of Thomson's formulation comes from the fact 
that other formulations rely more explicitly upon notions such as entropy or temperature 
which require rather elaborate identification in terms of microscopic variables, whereas 
this one concentrates on the concept of work (energy exchange) which can be considered 
as a primary, microscopic concept, as we show below. 



General setup for Thomson's formulation 



In its original form Thomson's formulation states that if a thermodynamic system S, 
which interacts with a single macroscopic thermal bath B at uniform temperature T and 
with a source of work W, evolves in a closed cycle, it cannot yield work to the latter 
external system W. In other words, such a process cannot extract energy from the heat 
bath. 

We assume that interaction of B with S is sufficiently weak, yielding a negligible 
contribution to the energy. (Later in this section we will comment on the situation with 
strong coupling with the bath). Over sufficiently long times, energy transfer identified 
with heat may result from this interaction. The bath can therefore set up a canonical 
equilibrium at the temperature T for S, without interacting directly with the source of 
work. 

In order to separate from each other the effects of the interactions of S with B and with 
W, we shall consider in the following a cyclic process that takes place in two steps. A 
canonical thermal equilibrium of S is assumed to have already been established for times 
t < 0. The interaction process that couples S and W takes place between the times t = 
and t = X. The delay x is supposed to be sufficiently short so that no heat is exchanged 
with the bath during this first step of the process. This is possible due to a weak coupling 
between the system S and the bath B. During the second step, for t > X, the energy 
of W is left unchanged, while S returns to its original thermal equilibrium state at 
temperature T through interaction with the bath B. Altogether, if the cycle is closed 
as regards the system S, a positive amount of energy should according to Thomson have 
been transferred from W to S between the times t = and t = X, and this amount should 
be released from S to B for t > x. This two-step analysis of the process will allow us to 
focus first on the coupled evolution of S and W in the time-interval < t < x, and to 
postpone the forthcoming thermal exchange with the bath. 

More precisely, in order to define the work done by the source W, we express the 
Hamiltonian which generates the evolution in this time-interval as 

H = H s +Hj + H Wl (1) 

where H s (H w ) depends only on the variables of S (W), with [H S ,H W ] = 0, and where 
Hj describes the coupling between S and W. To fix ideas we shall first take Hj in the 

form: Hj = z| . The observable X belongs to the system S, and corresponds to a "force" 
variable such as the pressure for a gas or the magnetic moment for a magnetic material. 
The conjugate variable £ refers to the system W, and corresponds to a "position" variable 
such as the volume for a gas or the magnetic field for a magnetic source. Work done by 
S on W till a moment is identified as the energy yielded by the source W, or equivalently 
as the decrease of the expectation value H w of its Hamiltonian 0]: W = (# w ) — (HJ ( . 

Thomson's formulation for macroscopic sources 

In the thermodynamic limit for the system W (but not necessarily for the system S), 
the fluctuations in £ are negligible at all relevant times; in relative value they are small 



as the inverse square root of the number of microscopic degrees of freedom of W. The 
operator fc, can therefore be replaced by its expectation value (g) t = tr[^p w (?)] = 
Pw(0 = tr sP( ? )' where p(t) is the total density matrix of S and W, and where p w (t) is 
the marginal density matrix of W. The evolution of ^ (t) is generated by the Ehrenfest 
equation: Hd^(t)/dt = i( [#,f] ) t . 

More precisely, if the fluctuation (AE, 2 ) t , where At, = ^ — (£} t , is negligible at all 
times, the Schwartz inequality | AZ) t | 2 < (A^ 2 ) t (AZ 2 ) f , where Z is any operator of 
S or W, implies that we can make the replacement of the operator ^ by its expectation 
value (E,) t = E, (t) everywhere, in particular in its Hamiltonian, which thus takes the form 
H = H s (t) +H W = H s +X E, (t) +H W , where an effective time-dependent Hamiltonian 
H s (t) appears for the system S. This form of H implies the factorization of the evolution 
operator, and that of the density operator p(t) = p s (t) <8> Pw(0 at a ^ times, provided 
the initial state at t = is also factorized. Hence in the Schrodinger picture the density 
matrix p s (t) of the system S evolves according to the unitary transformation U generated 
by H s (t). 

The equilibrium condition at time t = provides the initial density operator of S as 

p s (0) = e-^°)/ r /Z, Z = tre- S s(0)/r. (2) 

A cyclic transformation is characterized here by the fact that ^ (t) has returned to its 
initial value at time f = t: £(t) = £(0), H s (r) = H s (0). Owing to the lack of 

fluctuations in f , this condition is sufficient to ensure that the overall process can be 
rigorously cyclic at the microscopic level. Indeed, since the state of S is not correlated 
at the time t = x with that of W, it can return later on to its original form (Eh at t = 
through interaction with the thermal bath B. The work given by W to S between the 
times and % is equal due to: W = tr[p(x)H s (0) -p s (0)// s (0)]. With help of (EJ) one 
gets, using the fact that U is unitary: 

W 

- = tr[p s (T)lnp s (T)-p s (T)lnp s (0)]>0. (3) 

This expression is nothing but the relative entropy of P s (t) with respect to p s (0); it is 
strictly positive if p s (r) ^ p s (0) 0. 

As shown by (|3|), the work is positive for a macroscopic source of work W, whether 
the system S is macroscopic or microscopic. Thus, in the second case this entails an 
extension of the original Thomson's statement [Q, |5p, which was verified in specific 
examples |7p. The irreversibility of the process arises from the lack of symmetry 
between the times and t which is introduced by the initial condition (EJ). Usually, the 
breaking of time-reversal-invariance is associated with a dissipative process. Here the 
latter process is the interaction with the heat bath, which will take place after the time T. 

The above treatment can readily be generalized to an interaction with several (not 
necessarily independent) sources of work, where Hj = ELi^/S- Altogether, if the 
source W of work is a macroscopic system, the lack of fluctuations in the variables 

which couple W to the system S, ensures the second law in Thomson's form as soon 
as S is initially in a canonical equilibrium state. 



The fate of Thomson's formulation for mesoscopic sources. 



Let us now turn to the situation where W is no longer a macroscopic object. We 
have to deal with the operator nature of the <^-'s and we can no longer leave aside the 
Hamiltonian H w of the source of work, when studying the dynamics of S. 

Work is still defined as the decrease of the energy of the source. We will again assume 
that in its initial state the system S is in canonical thermal equilibrium, and is not 
correlated with W: p(0) = p s (0) <g> p w (0), where p s (0) is given by @ with H s (0) = 

H s +L/=i-A;(£/)o- A detailed discussion on the necessity of this initial conditions is 
given in However, the definition of a cyclic process is no longer unambiguous. This 
condition may indeed be generalized in two different ways, both of which are rather 
natural [|]] . To keep our presentation concise we will here outline only one of the two 
definitions proposed in [Jsp. Since the variables X i and q t can now be correlated, and 
since the interaction term H { involves a sum of their products, we search a condition 
for such a sum of products. We notice that in the considered mesoscopic case the states 
of S and W do become correlated for t > 0. Since now both pit) and H l irreducibly 
live in the common Hilbert space of S and W, the most standard way to define energy 
exchange between two systems is to require that the average interaction Hamiltonian 
does not contribute to the total energy budget, i.e., it is zero both initially and finally. For 

our situation the proper interaction Hamiltonian is L;=i^Q(^ ~~ (£i)o)' smce i ts average 
is zero at the initial time t = 0. Requiring that this average is also equal to zero at some 
final time T, we get for a cyclic process with the duration x: 

i(X 1 i i )r = t(X i )r(i i ) . (4) 
(=1 (=1 

For a large system W we recover the macroscopic condition for r = 1, since the lack of 
fluctuations in t, implies the lack of correlations withX: (X%) t ~ (X) t (q) t . 

Work in the short-time limit. We shall discuss for (Q) the sign of the work integrated 
over the duration x of a closed cycle, in the limit where x is sufficiently small so that 
we can expand W in powers of x. This limit is especially interesting, since mesoscopic 
contributions to the work can overcome the main thermodynamic term: as we shown [§], 
the thermodynamic contribution to the work, evaluated by neglecting the fluctuations of 
<^-'s and given by @, scales as x for small x, whereas in the mesoscopic case W ~ X . 
Expanding the Heisenberg equations of motion one gets for the work [||]: 

= t ({A| i -,A|j + )£(p a - jPj8 )(^-^)9l{(aft| J 8}( J 8|Z,ia)} (5) 

i,k=l a</3 

+i t <M> I(Pa + ^)(^-M3{(aft|j8)(j8|Z ( .|a)}, (6) 

i,k=l a<p 

where {...,... } + denotes anti-commutator, 3i and 3 stand for the real and imaginary 
parts, respectively. {\oc)} is the common eigenbase of p s (0) and H s (0), and p a , h a 
are the corresponding eigenvalues: p a = e~^ /% /£ a e~^ /% . For r = 1, where only the 




term @ survives, one can show that W > because p s (0) is a decreasing function of 
H s . Since the thermodynamic contribution to the work is negligible small, the amount of 
dissipated work predicted by the second law, is enhanced in the mesoscopic domain. The 
contribution (B) reflects an effect of interference between different types of work (chan- 
nels), since it contains non-diagonal terms over i,k and becomes zero with ( [<^,,<§ ; ] )• 
Now assume that the temperature of the system S is so high (fluctuations are strong) 
that p a ~ pp for all the /3 and a that are connected by matrix elements of X k and X { . 

Then the contribution of @ can be disregarded, and provided that ( [1^., f ; ] ) 7^ and 
3{ (oc\X k |j8)(j8 |X-|a) } 7^ 0, the sign of @ can be made negative. Thus the many-source 
situation can lead to a violation of the second law. The above results also apply to the 
classical limit, where Poisson brackets take the place of commutators. 

Let us then examine the thermalization process which takes place for t > X. It is 
straightforward to show that due to the taken definition of cyclic processes, marginal 
density operator of S is the equilibrium distribution @>, both at the preparation time 
t = and after the interaction with the thermal bath that takes place for t > x. 

Exactly solvable models. The result above on the violation and enhancement of the 
Thomson's formulation of the second law do not use specific properties of the involved 
systems S and W, but unfortunately are essentially restricted to short times. Moderate 
and long times can be explored by help of exactly solvable models for S and W. To 
this end we investigated two such models [|| |^]. The first of them is Jaynes-Cummings 
model describing interaction of two-level atom or spin and a single mode of radiation 
[[Kip. As for our purposes, we can naturally identify the two-level atom as the system S, 
and the electromagnetic field - which is controlled externally and admits a well-defined 
macroscopic limit - as the source W. The Hamiltonian of the model reads 

H s = ^Hcod z , H w =Hcod^d, H I =Hg(d + d + d_d^), (7) 

where a (d^) is the annihilation (creation) operators of the field, d x , a y , d z , d + = 
(& x ± io y )/2 are the Pauli matrices of the spin, and g > is the coupling constant. The 
initial state of the spin is thermal with temperature T and that of the mode is described 
by a coherent state \a) with an amplitude a. With help of the known exact solution 
of this model we found some new mechanisms of the Thomson's law violation in the 
mesoscopic regime; for a detailed discussion see [§]. 

In the second exactly solvable model [^p the systems S and W are two ensembles of 
/V independent classical harmonic oscillators, coupled one-to-one via their coordinates 
x and y. Like in an ideal gas, one may describe only one set of them, with Hamiltonians 

H s = l -mx 2 + l -m(0 2 x 2 , H w = X -My 2 + ^MQ. 2 y 2 , H I = -gxy, (8) 

The jc-oscillator, associated with S, starts its evolution from a thermal state at a tem- 
perature T . The y-oscillator starts from a macroscopic state with a well-defined initial 
coordinate y(0) and velocity y(0). It plays the role of the work-source W. This model not 
only allows to discuss limits of the second law for one closed cycle, but actually admits 
rather detailed many-cycle considerations [0. We determined conditions under which 



an infinity of cycles with constant yield is possible. This is a mesoscopic 'perpetuum 
mobile' that may cool the macroscopic bath B [^p. 



Clausius principle and its microscopic foundations. 

Finally, we shortly analyze the Clausius principle: heat goes from higher temperature 
to lower one. To put this statement on the microscopic ground, one considers two 
interacting systems 1 and 2 with the total Hamiltonian [|T], ITT], |T2[ ]: H tot = H i + 
H 2 + Hp where H l (H 2 ) is the Hamiltonian of the subsystem 1 (2), and H l is the 
interaction Hamiltonian. The initial state consists of two equilibrium subsystems at 
different temperatures, p tot (0) = p^O) ®p 2 (0), where p k (0) = exp[—H k /T k ]/Z k for 
A 1.2. 

Let us denote AU l = {H x ) t - (fi x ) Q , AU 2 = (H 2 ) t - (H 2 ) Q and AUj = Uj(t) - £7,(0+). 
It was rigorously proven that [JTTJJ: 

All, AU 7 

^r 1 + ^7 1 >0, (9) 

whereas correct but less rigorous arguments towards its validity are provided in [jlj]. 
After the coupling between the two subsystems has been turned on, the total energy is 
conserved. Thus we get from (g): (T 2 — AU l > T^Uj. In case T 2 > T i and AU I > 
this inequality proves the Clausius principle. If Hj commutes with H l and H 2 , Uj is 
conserved, implying that (T 2 — Tj) Af/j > 0, as proven by Reents [JT2|] . 

When T 2 > T x but AUj < there may, in principle, be a flow from low to high 
temperature, but it is bounded if AUj is bounded. In particular, such a current cannot 
be constant and everlasting. 
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